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Abstract — Chaos-based approaches are frequently proposed 
in information hiding, but without obvious justification. Indeed, 
the reason why chaos is useful to tackle with discretion, 
robustness, or security, is rarely elucidated. This research work 
presents a new class of non-blind information hiding algorithms 
based on some finite domains iterations that are Devaney's 
topologically chaotic. The approach is entirely formalized and 
reasons to take place into the mathematical theory of chaos 
are explained. Finally, stego-security and chaos security are 
consequently proven for a large class of algorithms. 

I. Introduction 

Chaos-based approaches are frequently proposed to im- 
prove the quality of schemes in information hiding Hi- 
ll?]. In these works, the understanding of chaotic systems 
is almost intuitive: a kind of noise-like spread system 
with sensitive dependence on initial condition. Practically, 
some well-known chaotic maps are used either in the data 
encryption stage J2], 0, in the embedding into the carrier 
medium, or in both |fl~), 0. 

This work focus on non-blind binary information hiding 
scheme: the original host is required to extract the binary 
hidden information. This context is indeed not as restrictive 
as it could primarily appear. Firstly, it allows to prove the 
authenticity of a document sent through the Internet (the 
original document is stored whereas the stego content is 
sent). Secondly, Alice and Bob can establish an hidden 
channel into a streaming video (Alice and Bob both have 
the same movie, and Alice hide information into the frame 
number k iff the binary digit number k of its hidden 
message is 1). Thirdly, based on a similar idea, a same 
given image can be marked several times by using various 
secret parameters owned both by Alice and Bob. Thus more 
than one bit can be embedded into a given image by using 
dhCI dissimulation. Lastly, non-blind watermarking is useful 
in network's anonymity and intrusion detection [6|, and to 
protect digital data sending through the Internet Q. 

Methods referenced above are almost based on two fun- 
damental chaotic maps, namely the Chebychev and logistic 
maps, which range in BL To avoid justifying that functions 
which are chaotic in K still remain chaotic in the computing 
representation (i.e., floating numbers) we argue that func- 
tions should be iterated on finite domains. Boolean discrete- 
time dynamical systems (BS) are thus iterated. 

* Authors in alphabetic order 



Furthermore, previously referenced works often focus on 
discretion and/or robustness properties, but they do not 
consider security. As far as we know, stego-security (8] and 
chaos-security have only been proven on the spread spectrum 
watermarking J9), and on the dhCI algorithm |[T0l . which is 
notably based on iterating the negation function. We argue 
that other functions can provide algorithms as secure as the 
dhCI one. This work generalizes thus this latter algorithm 
and formalizes all its stages. Due to this formalization, we 
address the proofs of the two security properties for a large 
class of steganography approaches. 

This research work is organized as follows. Section [TT] 
first recalls the BS context. The new class of algorithms, 
which is the first contribution, is firstly introduced in Sec. [HI] 
Section [IV] shows how secure is our approach: this is 
the second contribution of the present paper. Instances of 
algorithms guaranteeing that desired properties are presented 
in Sec. [V] Discussion, conclusive remarks, and perspectives 
are given in the final section. 

II. Boolean Discrete-Time Dynamical Systems 

In this section, we first give some recalls on Boolean 
discrete dynamical Systems (BS). With this material, next 
sections formalize the information hiding algorithms based 
on chaotic iterations. 

Let n be a positive integer. A Boolean discrete-time 
network is a discrete dynamical system defined from a 
Boolean map f : B" — > B" s.t. 

x = (x!,...,x„) i y f{x) = 

and an iteration scheme {e.g., parallel, serial, asyn- 
chronous. . . ). With the parallel iteration scheme, the dynam- 
ics of the system are described by x t+l = /(x*) where 
x a e B™. Let thus Ff : [1; n] x B" to B" be defined by 

Ff(i,x) = (xi, . . . ,Xi-i, fi(x),x i+ i, . . . ,x n ), 

with the asynchronous scheme, the dynamics of the system 
are described by x t+1 = -F/(st, x*) where x° £ B" and 
s is a strategy, i.e., a sequence in [l;n] N . Notice that this 
scheme only modifies one element at each iteration. 
Let Gf be the map from [1; nf 1 x B" to itself s.t. 

G f (s,x) = (a(s),Ff{s Q ,x)), 

where a(s) t — s t +i for all t in N. Notice that parallel 
iteration of Gf from an initial point X° = (s,x°) describes 



the "same dynamics" as the asynchronous iteration of / 
induced by the initial point x° and the strategy s. 

Finally, let / be a map from B™ to itself. The asyn- 
chronous iteration graph associated with / is the directed 
graph r(/) defined by: the set of vertices is B"; for all 
x G B" and i G r(/) contains an arc from x to 

F f (i,x). 

III. Formalization of Steganographic Methods 

The data hiding scheme presented here does not constrain 
media to have a constant size. It is indeed sufficient to 
provide a function and a strategy that may be parametrized 
with the size of the elements to modify. The mode and the 
strategy-adapter defined below achieve this goal. 

Definition 1 (Mode) A map f, which associates to any n G 
IN an application f n : B" — > B™, is called a mode. 

For instance, the negation mode is defined by the map that 
assigns to every integer n G IN* the function -i n : B™ — ► 

B n ,-i„(a;i, . . . ,x n ) i-> (xj,...,x^). 

Definition 2 (Strategy-Adapter) A strategy-adapter is a 
function S, from N to the set of integer sequences, that 
associates to n a sequence S G [1,71]^. 

Intuitively, a strategy-adapter aims at generating a strategy 
(5*)* eN where each term S* belongs to [1, n]. 

Let us notice that the terms of x that may be replaced by 
terms issued from y are less important than other: they could 
be changed without be perceived as such. More generally, a 
signification function attaches a weight to each term defining 
a digital media, w.r.t. its position t: 

Definition 3 (Signification function) A signification func- 
tion is a real sequence (u fe ) feeN . 

For instance, let us consider a set of grayscale images 
stored into 8 bits gray levels. In that context, we consider 
u k = 8 — (k mod 8) to be the fc-th term of a signification 
function (u fe ) fc6N . 

Definition 4 (Significance of coefficients) Let (u k ) k£N be 

a signification function, m and M be two reals s.t. m < M. 
Then the most significant coefficients (MSCs) of x is the 
finite vector um, the least significant coefficients (LSCs) of 
x is the finite vector u m , and the passive coefficients of x 
is the finite vector u p such that: 

um — (k | k G IN and u k > M and k <| x |) 
u m = {k | k G IN and u k ^ m and k ^ x ) 
Up = (fc | k G IN and u k G]m; M[ and k ^| x |) 

For a given host content x, MSCs are then ranks of 
x that describe the relevant part of the image, whereas 
LSCs translate its less significant parts. We are then ready 
to decompose an host x into its coefficients and then to 
recompose it. Next definitions formalize these two steps. 



Definition 5 (Decomposition function) Let (u fc ) fceN be a 

signification function, 58 the set of finite binary sequences, 
yi the set of finite integer sequences, m and M be two reals 
s.t. m < M. Any host x may be decomposed into 

(u M , u m , u p , <f> M , cf) m , 4> p ) 6 01 X OT; X 9t x <B X 58 X 93 

where 

• Um, U m , and u p are coefficients defined in Definition 

m 

• 4>m = {x u ™ , x u m , . . . , x ul "'"' ' ^ ; 
. m = (x<,x<,...,x u % m ^; 

. ^ = (x u p,x u2 p, . . . ,x u " pi y 

The function that associates the decomposed host to any 
digital host is the decomposition function. It is further 
referred as dec(u, m, M) since it is parametrized by u, m 
and M. Notice that u is a shortcut for (it fc ) fceN . 

Definition 6 (Recomposition) Let 

(uM,u m , u p , <f>M,<t>m, 4>p) G9tx91x9Ix*8x23xQ3 s.t. 

• the sets of elements in um, elements in u m , and 
elements in u p are a partition of [1, n\; 

< \u M \ = \<Pm\, \u m \ = \(f m \, and \u p \ = \<p p \. 
One may associate the vector 

\um\ \u m \ \u p \ 

X = H fM^ui, + ^n- e u^ + ^ P - e K 
i—1 i—1 i—1 

where (e,:)ig]N is the usual basis of the R— vectorial 
space (R ,+,.). The function that associates x to any 
(um ,Um, Up, (j)M j 4>m, 4>p) following the above constraints 
is called the recomposition function. 

The embedding consists in the replacement of the values 
of 4> m of x's LSCs by y. It then composes the two decom- 
position and recomposition functions seen previously. More 
formally: 

Definition 7 (Embedding media) Let dec(u, m, M) 
be a decomposition function, x be a host content, 
(um, u m , Up, <j)M, 4> m , 4*p) be its image by dec(u,m,M), 
and y be a digital media of size \u m \. The digital media 
z resulting on the embedding of y into x is the image of 
(uM 1 u m ,Up,(j)Miyi4>p) by the recomposition function rec. 

Let us then define the dhCI information hiding scheme: 

Definition 8 (Data hiding dhCI) Let dec(u, m, M) 

be a decomposition function, f be a mode, S 
be a strategy adapter, x be an host content, 
(uM,u m , Up, 4>m, 4>mi 0p) be its image by dec(u, m, M), q 
be a positive natural number, and y be a digital media of 
size I = \u m \. 

The dhCI dissimulation maps any (x, y) to the digital 
media z resulting on the embedding of y into x, s.t. 



• We instantiate the mode f with parameter I = \u m \, 
leading to the function fi : B' — > B ; . 

• We instantiate the strategy adapter S with parameter 
y (and some other ones eventually). This instantiation 
leads to the strategy S y £ [1; ZJ N . 

• We iterate Gf t with initial configuration (S y ,cj) m ). 

• y is the q-th term. 

To summarize, iterations are realized on the LSCs of 
the host content (the mode gives the iterate function, the 
strategy-adapter gives its strategy), and the last computed 
configuration is re-injected into the host content, in place of 
the former LSCs. 
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Figure 1. The dhCI dissimulation scheme 



We are then left to show how to formally check whether 
a given digital media z results from the dissimulation of y 
into the digital media x. 

Definition 9 (Marked content) Let dec{u, m, M) be a de- 
composition function, f be a mode, S be a strategy adapter, 
q be a positive natural number, and y be a digital media, 

(um, u mi Up, (pM, (ftm, 4>p) be the image by dec(u, m, M) of 
a digital media x. Then z is marked with y if the image 
by dec(u, m, M) of z is (uM,u m , u p , <pM,y, 4>p) where y is 
the right member of (S y , <f> m ). 



Various decision strategies are obviously possible to deter- 
mine whether a given image z is marked or not, depending 
on the eventuality that the considered image may have been 
attacked. For example, a similarity percentage between x 
and z can be computed, and the result can be compared to 
a given threshold. Other possibilities are the use of ROC 
curves or the definition of a null hypothesis problem. The 
next section recalls some security properties and shows how 
the dhCI dissimulation algorithm verifies them. 

IV. Security Analysis 

Stego-security [8j is the highest security class in 
Watermark-Only Attack setup. Let K be the set of embed- 
ding keys, p(X) the probabilistic model of No initial host 
contents, and p(Y \K) the probabilistic model of No marked 
contents s.t. each host content has been marked with the 
same key K and the same embedding function. 

Definition 10 (Stego-Security [8|) The embedding func- 
tion is stego-secure if \/K G M.,p(Y\K) = p(X) is 
established. 

Let us prove that, 

Theorem 1 Let e be positive, I be any size of LSCs, 
X - U (»'), fi be an image mode s.t. T(fi) is strongly 
connected and the Markov matrix associated to fi is doubly 
stochastic. In the instantiated dhCI dissimulation algorithm 
with any uniformly distributed (u.d.) strategy-adapter which 
is independent from X, there exists some positive natural 
number q s.t. \p(X q ) — p(X)\ < e. 

Proof: Let deci be the bijection between B' and 
[0, 2 l — 1] that associates the decimal value of any bi- 
nary number in B'. The probability piX 1 ) = (p(X* = 
eo),...,p(X t = G2 l -i)) f° r e j £ 13' i s tr| us equal to 
(pldeciiX*) = 0, . . . ^(deciiX 1 ) = 2 l - 1)) further denoted 
by 7T*. Let i g [0, 2 l - 1], the probability p(deci(X t+1 ) = i) 
is 

2 ! -l I 

^pideciiX*) = j, = k, % = k j, f k (j) = i fc ) 

j=0 k=l 

where i =& j is true iff the binary representations of i and j 
may only differ for the fc-th element, and where abusively 
denotes the fc-th element of the binary representation of i. 
Next, due to the proposition's hypotheses on the strategy, 

p(deci{X l ) = j,S l = k,i =k j,fk(j) = ifc) is equal to 
y.p(deripf *) = j, i = k j, fk{j) = ik)- Finally, since i = k j 
are fk(j) = «fe are constant during the iterative process and 
thus does not depend on X 1 , we have 



r t+i 
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Since j £ P(« =fc ij /fe(j) = *fc) is e q ual t0 ^ where 

fe=i 

M is the Markov matrix associated to /; we thus have 

2 ! -l 

tt| +1 = ^ 7r*-.M 3 -i and thus 7r t+1 = tt*M. 

First of all, since the graph is strongly connected, 
then for all vertices i and j, a path can be found to reach j 
from i in at most 2 l steps. There exists thus fcy G Jl, 2'J s.t. 
M*^ > 0. As all the multiples I x fc.^ of fe^ are such that 

tx k 

we can conclude that, if fc is the least common 
multiple of {kij/i,j G [1, 2'J} thus Vi, j G [1,2 ! ],M^ > 
and thus A/ is a regular stochastic matrix. 

Let us now recall the following stochastic matrix theorem: 

Theorem 2 (Stochastic Matrix) If M is a regular stochas- 
tic matrix, then M has an unique stationary probability 
vector ir. Moreover, if 7r° is any initial probability vector 
and 7r* +1 = 7r'.M for t = 0, 1, . . . then the Markov chain 
7T converges to n as t tends to infinity. 

Thanks to this theorem, M has an unique stationary 
probability vector tt. By hypothesis, since M is doubly 
stochastic we have (^r, = ( J,-, . . . , ^r)M and thus 

7r = (^tj • • • , ^r). Due to the matrix theorem, there exists 
some q s.t. |7r 9 — 7r| < e and the proof is established. ■ 

Since p(Y\K) is p(X q ) the method is then stego-secure. 

Let us focus now on chaos-security properties. An infor- 
mation hiding scheme S is said to have such a property if 
its iterative process has a chaotic behavior, as defined by 
Devaney, on this topological space. This problem has been 
reduced in 1111 which provides the following theorem. 

Theorem 3 Functions f : B" — > B" such that Gf is 
chaotic according to Devaney, are functions such that the 
graph r(/) is strongly connected. 

We immediatly deduce: 

Corollary 1 All the dhCI dissimulation algorithms follow- 
ing hypotheses of theorem [7J are chaos-secure. 

V. Instantiation of Steganographic Methods 

Theorem [T] relies on a u.d. strategy-adapter that is inde- 
pendent from the cover, and on an image mode /; whose it- 
eration graph T(/i) is strongly connected and whose Markov 
matrix is doubly stochastic. 

The CIIS strategy adapter IflOl has the required properties: 
it does not depend on the cover, and the proof that its outputs 
are u.d. on [1, lj is left as an exercise for the reader (a u.d. 
repartition is generated by the piecewise linear chaotic maps 
and is preserved by the iterative process). Finally, lfl2l has 
presented an iterative approach to generate image modes fi 
such that T(fi) is strongly connected. Among these maps, 



it is obvious to check which verifies or not the doubly 
stochastic constrain. 

VI. Conclusion 

This work has presented a new class of information hiding 
algorithms which generalizes algorithm [10] reduced to the 
negation mode. Its complete formalization has allowed to 
prove the stego-security and chaos security properties. As far 
as we know, this is the first time a whole class of algorithm 
has been proven to have these two properties. 

In future work, our intention is to study the robustness 
of this class of dhCI dissimulation schemes. We are to 
find the optimized parameters (modes, stretegy adapters, 
signification coefficients, iterations numbers. . . ) giving the 
strongest robustness (depending on the chosen represen- 
tation domain), theoretically and practically by realizing 
comprehensive simulations. Finally these algorithms will be 
compared to other existing ones, among other things by 
regarding whether these algorithms are chaotic or not. 
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